Topical and Non-Topical Approaches to Measure Similarity between Arabic Questions

نویسندگان

چکیده

Questions are crucial expressions in any language. Many Natural Language Processing (NLP) or Understanding (NLU) applications, such as question-answering computer systems, automatic chatting apps (chatbots), digital virtual assistants, and opinion mining, can benefit from accurately identifying similar questions an effective manner. We detail methods for similarities between Arabic that have been posted online by Internet users organizations. Our novel approach uses a non-topical rule-based methodology topical information (textual similarity, lexical semantic similarity) to determine if pair of similarly paraphrased. method counts the linguistic distances each question. Additionally, it identifies accordance with their format scope using expert hypotheses (rules) experimentally shown be useful practical. Even there is high degree similarity When question (Timex Factoid—inquiring about time) Who inquiry (Enamex Factoid—asking named entity), they will not similar. In experiment 2200 pairs, our attained accuracy 0.85, which remarkable given simplicity solution fact we did employ language models word embedding. order cover common queries presented users, gathered various forums resources. this study, describe unique detecting does require intensive processing, sizable corpus, costly repository. Because many rich textual resources, especially important informal text processing on Internet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distinguishing between Topical and Non-Topical Information Diffusion Mechanisms in Social Media

A number of recent studies of information diffusion in social media, both empirical and theoretical, have been inspired by viral propagation models derived from epidemiology. These studies model the propagation of memes, i.e., pieces of information, between users in a social network similarly to the way diseases spread in human society. Importantly, one would expect a meme to spread in a social...

متن کامل

Analgesic synergy between topical lidocaine and topical opioids.

Topical drugs avoid many of the problematic side effects of systemic agents. Immersion of the tail of a mouse into a solution of dimethyl sulfoxide (DMSO)-containing morphine produces a dose-dependent, naloxone-sensitive, analgesia (ED(50) 6.1 mM; CL 4.3, 8.4) limited to the portion of the tail exposed to the drug. DMSO alone in this paradigm had no analgesic activity. Like morphine, the opioid...

متن کامل

Topical tags vs non-topical tags: Towards a bipartite classification?

In this paper we investigate whether it is possible to create a computational approach that allows us to distinguish topical tags (i.e. talking about the topic of a resource) and non-topical tags (i.e. describing aspects of a resource that are not related to its topic) in folksonomies, in a way that correlates with humans. Towards this goal, we collected 21 million tags (1.2 million unique term...

متن کامل

Current topical and systemic approaches to treatment of rosacea.

Rosacea is a common, often overlooked, chronic facial dermatosis characterized by intermittent periods of exacerbation and remission. Clinical subtypes and grading of the disease have been defined in the literature. On the basis of a genetic predisposition, there are several intrinsic and extrinsic factors possibly correlating with the phenotypic expression of the disease. Although rosacea cann...

متن کامل

Approaches to Measure Chemical Similarity ± a Review

Although the concept of similarity is a convenient for humans, a formal definition of similarity between chemical compounds is needed to enable automatic decision-making. The objective of similarity measures in toxicology and drug design is to allow assessment of chemical activities. The ideal similarity measure should be relevant to the activity of interest. The relevance could be established ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Big data and cognitive computing

سال: 2022

ISSN: ['2504-2289']

DOI: https://doi.org/10.3390/bdcc6030087